Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop
نویسندگان
چکیده
This paper gives an overview of the Patent Machine Translation Task (PatentMT) at NTCIR-10 by describing its evaluation methods, test collections, and evaluation results. We organized three patent machine translation subtasks: Chinese to English, Japanese to English, and English to Japanese. For these subtasks, we provided large-scale test collections, including training data, development data and test data. In total, 21 research groups participated and 144 runs were submitted. We performed four types of evaluations: Intrinsic Evaluation (IE), Patent Examination Evaluation (PEE), Chronological Evaluation (ChE), and Multilingual Evaluation (ME). We conducted human evaluations for IE and PEE.
منابع مشابه
Overview of the Patent Machine Translation Task at the NTCIR-9 Workshop
This paper gives an overview of the Patent Machine Translation Task (PatentMT) at NTCIR-9 by describing the test collection, evaluation methods, and evaluation results. We organized three patent machine translation subtasks: Chinese to English, Japanese to English, and English to Japanese. For these subtasks, we provided large-scale test collections, including training data, development data an...
متن کاملOverview of the Patent Translation Task at the NTCIR-7 Workshop
To aid research and development in machine translation, we have produced a test collection for Japanese/English machine translation and performed the Patent Translation Task at the Seventh NTCIR Workshop. To obtain a parallel corpus, we extracted patent documents for the same or related inventions published in Japan and the United States. Our test collection includes approximately 2 000 000 sen...
متن کاملOverview of the Eighth NTCIR Workshop
For the Eighth NTCIR Workshop (NTCIR-8), we selected and organized seven research areas as “tasks” to investigate, test and benchmark newly constructed test collections. These areas are Complex Cross-Lingual Question Answering (CCLQA), Information Retrieval for Question Answering (IR4QA), Geographic and Temporal Search (GeoTime), Multilingual Opinion Analysis (MOAT), Patent Machine Translation ...
متن کاملISTIC Statistical Machine Translation System for Patent machine translation in NTCIR-9
This paper describes statistical machine translation system of ISTIC used in the evaluation campaign of the patent machine translation task at NTCIR-9. In this year's evaluation, we participated in patent machine translation task for ChineseEnglish. Here we mainly describe the overview of the system, the primary modules, the key techniques and the evaluation results.
متن کاملOverview of the Patent Translation Task at the NTCIR-8 Workshop
Motivation Motivation l / • Systematic evaluation in NLP / IR is crucial – Large Test Collections are needed • We have produced test collections for p patent retrieval at NTCIR since 2001 – What is the next task for patent What is the next task for patent information? 2 History of Patent IR at NTCIR y • NTCIR-3 (2001-2002) T h l 2 years of JPO patent li ti * – Technology survey • Applied conven...
متن کامل